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(54) Title: REAGENTS AND A METHOD FOR SATURATION LABELLING OF PROTEINS 

(57) Abstract: A matched set of fluorescent dyes is provided, wherein each dye of the set is capable ofcovalent attachment to a 
protein and wherein each of the dyes has a molecular structure and a charge that is matched one with the other, such that relative 
electrophoretic mobility of a protein labelled with one dye of the set is the same as the electrophoretic mobility of the protein labelled 
with a different dye of the set. The matched set comprises at least two different fluorescent dyes of formula: wherein n is 1, 2, or 3; 
Zl and Z2 independently represent the carbon atoms necessary to complete a phenyl or naphthyl ring system; one of groups Rl and 
R2 is a target bonding group; remaining group Rl or R2 is selected from -(CH2)4-W or -(CH2)r-H; group R3 is hydrogen, except 
when either Rl or R2 is -(CH2)r-H, in which case R3 is W; and W is selected from sulphonic acid and sulphonate. The invention also 
provides a method for saturation labelling of a protein with a fluorescent dye so as to label all available target amino acid, suitably 
cysteine, residues in the protein, thereby giving a single population of labelled protein molecules. 
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Reagents and a Method for Saturation Labelling of Proteins 



The present invention relates to fluorescent dyes and to a method 
for labelling complex protein samples to enable differential protein 
5 analysis. 



2D Difference Gel Electrophoresis (DIGE) uses matched, spectrally- 
resolved fluorescent dyes to label protein samples prior to 2-dimensional 
(2D) electrophoretic separation (Minden, J. et al, Electrophoresis, (1997), 

10 1 8 , 2071). This fluorescent multiplexing approach overcomes many of 
the disadvantages of traditional 2D electrophoresis. Fluorescent pre- 
labelling of protein samples allows multiple samples to be run on the same 
gel, enabling quantitative differences between the samples to be easily 
identified by overlaying the fluorescent images. To allow fluorescent 

15 multiplexing, the protein samples must be labelled equally with the dyes 
and the migration of the labelled proteins in the gel must be positionally 
matched, to allow quantitative differences to be detected. 



WO 96/33406 (Minden, J. et al) discloses a method to detect 
20 differences between different cell samples using matched, spectrally 

resolved dyes to label protein in the samples prior to 2D electrophoresis. 
Described are dyes that are matched for molecular mass and charge to 
give equivalent migration. The approach employs cyanine dyes having an 
N-hydroxysuccinimidyl (NHS) ester reactive group to label amines. The 
25 NHS ester group is targeted at the e-amino of lysine residues in proteins 
and results in covalent labelling. Three different dyes (Cy2™, Cy3 and 
Cy5) are derivatised to enable multiplexing. These dye molecules have a 
molecular weight of approximately 500 Daltons and are matched in mass 
to give equivalent migration of the labelled protein. The dyes rely on the 
30 intrinsic charge of the cyanine fluor to compensate for the loss of a lysine 
positive charge on conjugation of the dye to lysine. WO 96/33406 



CONMMMATION COPY 
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describes the use of these dyes in e minimal labelling strategy to .abel 
between f and 2% of available attachment sites. To achieve equivalent 
migration, the dye pairs were matched for mass by compensating for the 
difference in the linker length between the indole rings of the cyanrne by 
, modulation of the size of the aliphatic chain attached to an indole nitrogen 
by 2 carbon units. Two or more spectrally resolved dyes that mee, these 
criteria represent a matched minimal labelling dye set. Equivalent 
migration of the matched lysine minimal labelling dye set was 
demonstrated using propyl Cy3 and methyl Cy5 to label proteins followed 
,0 by their 2D separation and overlaying the resulting images to show 
positional matching of the two labelled samples. 

Lysine residues are highly abundant in proteins, ensuring that this 
labelling strategy will represent all the proteins present in a complex 

15 sample. However, the typical lysine content of a protein is 7% and, ,f 
every lysine in a protein were labelled, it would result in a large mass shrft 
due to the dya. Thus, the strategy employed is to -minimally" label the 
protein to ensure .hat only - 1 in 5 protein molecules are labelled, thereby 
giving a statistical probability that each labelled molecule has only one 

20 dya attached. This creates a spot pattern on 2D tha, is very similar to 
silver stained images, but gives greater sensitivity and dynamic range and 
the ability to multiplex. 

A typical 2D gel analysis involves "picking" protein spots of interest 
25 from the gel for identification by MALDI-MS. The minimal labelling 

approach results in 2 spots per protein (to. labelled and unlabel.ed), with 
the majority of protein in the unlabel.ed spot. The unlabel.ed spot must 
be .ocated to recover sufficient protein to enable identification by MALDI- 
MS This requires an additional staining step prior to spot picking. In 
30 order to facilitate protein spot picking directly from fluorescent gels, a 
labelling strategy is required that gives a single spot per protein. 
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The labelling strategy employed in the present invention aims to 
saturate the protein with dye in order to label as many target residues as 
possible and to produce a single labelled spot per protein isoform. Thus, 
a cyanine dye molecule is coupled to all available target amino acid 

5 residues on a protein, thereby giving a single population of labelled protein 
molecules with a similar number of dye molecules attached. According to 
the present invention, saturation labelling is achieved using dye sets 
targeted at cysteine residues, which contain a thiol group. Cysteine 
residues are present in 95% of proteins, but there are fewer cysteine 

10 residues in each protein than lysine. This means that all protein molecules 
can be labelled, but each protein molecule will have fewer labelled 
residues than if lysine residues were labelled to saturation. This results in 
increased sensitivity of detection as the proportion of labelled protein is 
higher than with minimal labelling, but ensures that the proteins remain 

15 soluble. 



The increase in mass due to the addition of dye (the mass shift) 
with saturation labelling of cysteine residues will be larger than that 
employing a minimal labelling strategy of lysine residues. However, the 

20 increase in mass is less than would be observed if a saturation labelling 
approach was employed on lysine residues. The extent of the mass shift 
due to the dye addition will vary for each protein depending on the 
individual protein cysteine content (typically -2%) and the availability of 
the cysteine residues to the dye under the conditions of labelling. This 

25 results in a 2D spot pattern that is very different from published silver 
stained 2D protein maps. 



The present invention therefore provides fluorescent reagents and a 
method for reproducibly labelling, all of the available cysteine residues 
30 that are accessible in a mixture of cysteine-containing proteins. The 
cyanine dye derivatives according to the present invention provide 
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valuable sets of fluorescent labels, each having a common core structure 
and which are particularly useful for multiplex analysis. 

Accordingly, in a first aspect, the present invention provides a 
5 matched set of fluorescent dyes comprising at least two different 
fluorescent dyes having the formula (I): 



wherein n is different for each said dye and is 1 , 2, or 3; 
15 Z 1 and Z 2 independently represent the carbon atoms necessary to 
complete a phenyl or naphthyl ring system; 
one of groups R 1 and R 2 is the group: 



20 

where Y is a target bonding group; 

remaining group R 1 or R 2 is selected from -(CH2)4-W or _(CH2)r-H; 
group R 3 is hydrogen, except when either R 1 or R 2 is -(CH 2 )r-H, in which 
case R 3 is W; 

25 W is selected from sulphonic acid and sulphonate; 
p is an integer from 3 to 6; 
q is selected to be 2 or 3; and 
r is an integer from 1 to 5; 
and their salts; 

30 characterised in that when n of two of said dyes differs by + 1 , one of p, 
q and r of said two dyes differs by -1 . 



10 




o 

— (CH2) p C-NH — (CH2) q — Y 
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According to the present invention a matched set of fluorescent 
dyes is provided, wherein each dye of the set is capable of covalent 
attachment to a protein and wherein each of the dyes has a molecular 

5 structure and a charge that is matched one with the other, such that 
relative elect rophoretic mobility of a protein labelled with one dye of the 
set is the same or substantially the same as the elect rophoretic mobility 
of the protein labelled with a different dye of the same set. In one 
embodiment according to the first aspect, the matched set of dyes 

10 comprises at least two different fluorescent dyes, wherein each dye in 
said set is a compound having the formula (I) and wherein Z\ Z 2 , R\ R 2 , 
R 3 , Y, W, n, p, q and r are hereinbefore defined. Suitably, the different 
dyes in the matched set of fluorescent dyes are spectrally resolvable to 
enable different samples labelled with such dyes to be distinguished one 

15 from the other. Suitably, at least two dyes of the matched set of dyes 
have a structure according to formula (I) and may be selected from the 
trimethine cyanine dye class (in which n= 1), the pentamethine cyanine 
dye class (in which n = 2) and the heptamethine cyanine dyes (in which 
n= 3). 



Suitably, the matched set of fluorescent dyes according to the 
invention comprises at least two different fluorescent dyes having the 
formula (II): 



20 



25 




30 



(II) 

wherein n is different for each said dye and is 1 , 2, or 3; 
one of groups R 1 and R 2 is the group: 
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II 

— (CH2) p C-NH — (Chyq — Y 



where Y is a target bonding group; 
5 remaining group R 1 or R 2 is selected from -(CH 2 ) 4 -W or ~(CH 2 )r-H; 

group R 3 is hydrogen, except when either R 1 or R 2 is -(CH 2 )r^H, in which 
case R 3 is W; 

W is selected from sulphonic acid and sulphonate; 
p is an integer from 3 to 6; 
10 q is selected to be 2 or 3; and 
r is an integer from 1 to 5; 
and their salts; 

characterised in that when n of two of said dyes differs by + 1 , one of p, 
q and r of said two dyes differs by -1 . 

15 

Preferably, the matched set of fluorescent dyes comprises at least 
two different dyes according to formula (I) or (II) in which: 
n is selected to be 1 or 2; 
p is selected to be 4 or 5; 
20 q is selected to be 2 or 3; and 
r is selected to be 1, 2 or 3. 



Preferably, the target bonding group Y in each dye of the matched 
set of fluorescent dyes is the same and is selected from a maleimido 
25 group and an iodoacetamido group, A particularly preferred target 
bonding group for each dye is a maleimido group. 



30 



Particularly preferred are dye sets according to formula (I) or (II) 
that are selected from the trimethine cyanine class of dyes and the 
pentamethine cyanine class of dyes. Such dyes are described as Cy3™ 
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and Cy5 dyes. Absorbance and emission data for the cyanine dyes are 
shown below in Table 1 . 



Table 1 



Dye 


Fluorescence Colour 


Abs (nm) 


Em (nm) 


Cy3 


Orange 


550 


570 


Cy5 


Far red 


649 


670 


Cy7 


Near IR 


747 


774 



5 

Suitably, salts of the fluorescent dyes according to formula (I) or (II) 
may be selected from K + , Na + , NH4 + , RsNhT and FUN* where R is Ci to 
C4 alkyl. 



10 In an alternative embodiment, the matched set of dyes may 

optionally include one or more additional dyes, provided that each such 
additional dye possesses charge and mass characteristics, such that the 
electrophoretic mobility of a protein labelled with the dye is the same or 
substantially the same as the electrophoretic mobility of a protein labelled 

15 with a dye according to formula (I) or (II). Other such additional dyes may 
include benzoxazole-containing dyes. An example of an additional dye is 
Cy2. 



Exemplary sets containing pairs of fluorescent dyes according to 
20 formula (I) or (II) that are matched in electrophoretic mobility when 
coupled to proteins are as follows: 



Set 1 



25 



1-{6-{[2-{2,5-dioxo-2,5-dihydro-l/y-pyrrol-l-yl)ethyl]amino}-6-oxohexyl)-2- 
[(1 E,3£)-3-(1 -ethyl-3,3-dimethyl-5-su!pho-1 ,3-dihydro-2A/-indol-2- 
ylidene)prop-1-enyl]-3,3-dimethyl-3tf-indolium (Compound I); and 
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1 -(6-{[2-(2,5-dioxo-2,5-dihydro-1 /Y-pyrrol-1 -yl)ethyl]amino}-6-oxohexyl)- 
S^-dimethyi^-td^a^Sa-S-d^^-trimethyl-S-sulpho-l^-dihydro^/y- 
indol-2-ylidene)penta-1,3-dienyl]-3/y-indolium (Compound II); 



1 -{6-{[2-(2,5-dioxo-2,5-dihydro-1 /7-pyrrol-1 -yl)ethyl]amino}-6-oxohexyl)-2- 
[(1E,3£)-3-(1-propyl-3,3-dimethyl-5-sulpho-1,3-dihydro-2/y-indol-2- 
ylidene)prop-1-enyl]-3,3-dimethyl-3A/-indolium (Compound III); and 
10 1 -(6-{[2-(2,5-dioxo-2,5-dihydro-1 /V-pyrrol-1 -yl)ethyl]amino}-6-oxohexyl)- 
3,3-dimethyl-2-[(1 £,3£,5a-5-(1 -ethyl-S.S-trimethyl-S-sulpho-l ,3-dihydro- 
2#-indol-2-ylidene)penta-1 ,3-dienyl]-3/V-indolium (Compound IV); 



1 -(6-{[2-(2,5-dioxo-2,5-dihydro-1 /y-pyrrol-1 -yl)ethyl]amino}-6-oxohexyl)-2- 
[(1£,3£)-3-(1-ethyl-3,3-dimethyl-5-sulpho-1,3-dihydro-2A/-indol-2- 
ylidene)prop-1-enyl]-3,3-dimethyl-3/y-indolium (Compound I); and 
1 -(5-{[2-(2,5-dioxo-2,5-dihydro-1 W-pyrrol-1 -yl)ethyl]amino}-6-oxopentyl)- 
20 3,3-dimethyl-2-[(1£,3£,5£)-5-(1-ethyl-3,3-trimethyl-5-sulpho-1,3-dihydro- 
2/y-indol-2-ylidene)penta-1 ,3-dienyl]-3/y-indolium (Compound V); 



25 1 -(6-{[2-(2, 5-dioxo-2,5-dihydro-1 /V-pyrrol-l -yl)ethyl]amino}-6-oxohexyl)-2- 
[(1£,3a-3-(3,3-dimethyl(1-sulpho-butyl)-1,3-dihydro-2/y-indol-2- 
ylidene)prop-1-enyl]-3,3-dimethyl-3/y-indolium (Compound VI); and 
1 -(5-{[2-(2,5-dioxo-2,5-dihydro-1 //-pyrrol-1 -yl)ethyl]amino}-6-oxopentyl)- 
3,3-dimethyl-2-[(1£,3£,5£)-5-(3,3-dimethyl-(1-sulpho-butyl)-1,3-dihydro- 

30 2/y-indol-2-ylidene)penta-l,3-dienyl]-3//-indolium (Compound VII). 



5 



Set 2 



Set 3 



Set 4 
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1- (6-{[3-(2 / 5-dioxo-2 / 5-dihydro-1/y-pyrrol-1-yl)propyl]amino}-6-oxohexyl)- 

2- [(1 £,3£)-3-(1 -ethyl-3 # 3-dimethyl-5-sulpho-1 ,3-dihydro-2//-indol-2- 
5 ylidene)prop-1-enyl]-3,3<limethyl-3/y-indolium (Compound VIII); and 

1 -(6-{[2-(2,5-dioxo-2,5-dihydro-1 A/-pyrrol-1 -yl)ethyl]amino}-6-oxohexyl)- 
3,3-dimethyl-2-[(1£,3E,5£)-5-(1-eth^^ 

2/y-indol-2-ylidene)penta-1 / 3-dienyl]-3/y-indolium (Compound IV); and 
10 Set 6 

1 -(6-{[3-(2,5-dioxo-2,5-dihydro-1 tt-pyrroI-1 -yl)propyl]amino}-6-oxohexyl)- 
2-[<1F,3£)-3-(3,3-dimethyl(1-sulpho^^ 

ylidene)prop-1-enyl]-3,3<iimethyl-3A/-indolium (Compound IX); and 
15 1 -(6-{[2-(2,5-dioxo-2,5-dihydro-1 //-pyrrol-1 -yl)ethyl]amino}-6-oxohexyI)- 
3,3-dimethyl-2-[(1£,3£,5£)-5-(3,3^ime^^ 

2//-indol-2-ylidene)penta-1 ,3<lienyl]-3//-indoIium (Compound X). 

The present invention therefore provides a matched set of reagents 
20 for reproducibly labelling all cysteine residues accessible in a mixture of 
cysteine-containing proteins under the conditions used. Equivalent 
migration in the pi dimension is achieved using dye sets with an overall 
neutral charge in order to match the neutral charge on the thiol group to 
which the dyes are conjugated. Neutral cyanine dyes may be obtained by 
25 means of a sulphonic acid or sulphonate group substituent covalently 
attached to the dye structure. Moreover, it has been discovered that 
equivalent mass migration is not simply achieved by matching the 
molecular weight of the dye sets, but rather, requires careful manipulation 
of the overall size and mass of the dye in order to achieve positional 
30 matching of proteins labelled with such dyes. In order to obtain 

equivalent migration employing a cysteine saturation labelling approach, 
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I 



dye sets are required wherein each of the matched dyes according to 
formula (I) differs in mass by a single carbon unit. The dye sets may be 
obtained by varying different substituents attached to the dye 
chromophore. Two or more spectrally resolved dyes that meet these 
5 criteria represent a matched saturation dye set. 

The present invention also provides a method for saturation 
labelling of a protein with a fluorescent dye so as to label all available 
target amino acid residues in the protein, thereby giving essentially a 
10 single population of labelled protein molecules. Suitably, the target amino 
acid is a cysteine residue. By the term "available" it is meant amino acid 
residues that are accessible to the fluorescent dye for reaction. Available 
cysteine residues must be accessible, and in a reduced (i.e. in a free thiol) 
form. 

15 

Thus, in a second aspect, there is provided a method for labelling a 
mixture of proteins in a sample wherein each of said proteins contains one 
or more cysteine residues, said method comprising: 

i) adding to an aqueous liquid containing said sample a fluorescent 
20 dye wherein said dye contains a target bonding group that is covalently 

reactive with said proteins; and 

ii) reacting said dye with said proteins so that said dye labels said 
proteins; 

characterised in that all available cysteine residues in said proteins are 
25 labelled with said dye. 

Preferably, the fluorescent dye according to the method of the 
second aspect is a cyanine dye. Particularly preferred cyanine dyes for 
use in the method are those containing a sulphonic acid or sulphonate 
30 group, for example, the dyes according to formula (I). 
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Preferably, the target bonding group is selected from a maleimido 
group and an iodoacetamido group. 

In a preferred embodiment according to the second aspect, the 
method further comprises prior to step i), the step of treating the protein 
with a reductant. 

Preferably, the cyanine dye is used in a range of 5 to 200nmol of 
dye per 50|ig of protein. 

Preferably, the method for labelling a mixture of proteins in a 
sample according to the second aspect is performed at a pH in the range 
6.0 to 9.0. 



The invention also provides a method for labelling and thereby 
imparting fluorescent properties to a sample, suitably a protein sample, 
using a dye according to formula (I). Thus, in a third aspect, there is 
provided a method for labelling one or more proteins in a sample, the 
method comprising: 

i) adding to a liquid sample containing said one or more proteins a 
fluorescent dye of formula (I): 




(0 

wherein n is different for each said dye and is 1 , 2, or 3; 
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Z 1 and Z 2 independently represent the carbon atoms necessary to 
complete a phenyl or naphthyl ring system; 
one of groups R 1 and R 2 is the group: 



where Y is a target bonding group; 

remaining group R 1 or R 2 is selected from -(CH 2 )4-W or -(CH 2 )r-H; 
group R 3 is hydrogen, except when either R 1 or R 2 is -(CH 2 )r-H, in which 
case R 3 is W; 

W is selected from sulphonic acid and sulphonate; 
p is an integer from 3 to 6; 
q is selected to be 2 or 3; and 
r is an integer from 1 to 5; 
and their salts; 

characterised in that when n of two of said dyes differs by + 1 , one of p, 
q and r of said two dyes differs by -1 ; and 

incubating said dye with said sample under conditions suitable for 
labelling said one or more proteins. 

Preferably, each of Z 1 and Z 2 represents the carbon atoms 
necessary to complete a phenyl ring system. 

Preferably, the method according to the third aspect employs a 
fluorescent dye of formula (I) in which: 
n is selected to be 1 or 2; 
p is selected to be 4 or 5; 
q is selected to be 2 or 3; and 
r is selected to be 1 , 2 or 3. 



O 

— (CH2) p C-NH (CH2)q — Y 
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Preferably, the target bonding group Y is selected from a maleimido 
group and an iodoacetamido group. 

To perform the method for saturation labelling of cysteine residues 
5 in proteins, a number of reaction parameters must be considered, 
including: 

i) Accessibility of Protein Thiol Groups 

10 In native folded proteins many cysteine residues are involved in the 

secondary structure of proteins by disulphide bond formation and may 
also be buried within the core of the molecule. In order to label these 
residues the protein must be unfolded to enable accessibility of the dyes, 
using protein denaturants such as urea, and they must be reduced to 

15 generate the free thiol. The choice of reductant is important to enable 

efficient reduction of the protein. Commonly used thiol-containing protein 
reductants are dithiothreitol (DTT) and (3-mercaptoethanol. Suitable 
phosphine reductants include tributyl phosphine (TBP) and tris(2- 
carboxyethyl) phosphine (TCEP). Sodium borohydride may also be used. 

20 TCEP is a preferred reductant to give efficient reduction, whilst minimising 
reaction with the dye. 

The efficiency of reduction is influenced by the concentration of 
reductant (relative to the cysteine content of protein sample), the 

25 temperature of reaction (which influences protein denaturation) and the 
duration of the reaction. If the reductant is immobilised on beads this 
may also affect the efficiency. The optimal reductant concentration for a 
particular sample type will vary depending on the reductant used and the 
cysteine content of the sample. However, 50jag of protein typically 

30 requires 10nmol of TCEP for reduction (assuming average cysteine 

content at 2%). For mammalian samples with higher cysteine content, 
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then 20nmol of TCEP is typically used. It is also important to maintain 
the ratio of 1nmoI TCEP to 2nmol dye to give optimal labelling. 

The optimum combination of these factors varies depending on the 
5 individual protein structure and cysteine content. Higher reduction 
temperature will give increased labelling, as a result of increased 
denaturation of the protein and thus increased accessibility of cysteines. 
However, high temperatures may have an adverse effect on the protein. 
Urea exists in equilibrium with carbamylate, but at higher temperatures 
10 (generally >40°C) the equilibrium shifts in favour of the carbamylate. 
Carbamylate is a more chemically reactive species than urea and can 
attack primary amines (eg. N-terminal amino group and s-amino group of 
lysines), leading to artefactual charge heterogeneity and the generation of 
charge trains on 2D gels. Saturation labelling of cysteine residues at 37°C 
15 in the presence of urea does not exhibit carbamylation effects. 



ii) Dye Concentration 



The hydrophobic nature of the cyanine dyes requires the use of 
20 organic solvents (eg. 10% DMF) to aid solubility. To achieve saturation 
of the available target residues so that the labelling goes to completion it 
is necessary to use high concentrations of dye for labelling, whilst 
maintaining dye solubility. The use of a sulphonate group on the cyanine 
dyes helps to maintain solubility at high dye concentrations. The optimal 
25 dye concentration in the labelling reaction for a particular sample type will 
vary depending on the cysteine content of the sample and whether there 
are any other components in the sample which might interfere with 
labelling. However, the dye should always be present at least in excess 
of the thiol content to achieve saturation labelling and it is also important 
30 to maintain the ratio of 1 nmol TCEP to 2nmol dye to give optimal 

labelling. Using the dye in excess would typically require the dye to be in 
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the range 5 to 200nmol per 50|ig of protein sample. However, 50|ag of 
protein typically requires 20nmol of dye for a labelling reaction (assuming 
average cysteine content of 2%). For mammalian samples with higher 
cysteine content, 40nmol of dye is typically used. However, other 
components in the sample may also react with the dye. For example, 
liver samples may contain elevated levels of the tripeptide glutathione, in 
response to stress/toxic substances. As a consequence, increased dye 
concentrations may be required in order to maintain labelling efficiency. 
Serum samples containing elevated levels of albumin, which is a highly 
abundant protein with a high cysteine content, may also require increased 
dye concentrations. 

iii) pH of Labelling Reaction 

Cysteine residues are strong nucleophiles and may be labelled in 
proteins using reagents having iodoacetamide and maleimide reactive 
groups. A particularly preferred reactive group is the maleimido group. 
The pKa of a thiol group is critical in determining its reactivity. Above the 
corresponding pH, its nucleophilicity increases markedly as the thiolate 
(S~) form replaces the protonated SH species. In an isolated cysteine 
molecule, the thiol group has a pKa of approximately 8.6; however, the 
microenvironment of the thiol within the protein can also effect its pKa. 
The rate of reaction of thiol groups with maleimides increases at alkaline 
pH, due to the increased concentration of thiolate anion. Under alkaline 
conditions, hydrolysis of the maleimide to maleamic acid becomes a 
significant side reaction, and competing reactions with other functional 
groups such as lysine and histidine become more significant. 



The e-amino group of lysine behaves as a typical amine with a pKa 
in proteins of 9.0 - 9.5. At a lower pH, e.g. pH 6.5, the majority of 
amine groups are predominantly in the protonated, unreactive NH3 + form 
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and the reaction with maleimide groups is -1000 fold slower than with 
thiols. Terminal amine groups have pKa values of 7.5 - 8 depending on 
the amino acid concerned. The terminal amine will be present in the un- 
protonated (reactive) form at lower pH than those of lysine. Thus, 
reaction of maleimide groups with amines requires a higher pH than the 
reaction with thiol groups. Consequently, labelling of thiol groups with 
maleimides is more specific at lower pH, with less potential to label lysine 
residues. Thus, the pH of the reaction is also a critical factor in achieving 
saturation labelling and for specific labelling of thiol groups. Preferably, 
saturation labelling using fluorescent dyes, for example according to 
formula (I) is performed at a pH range of 6.0 to 9.0, most preferably a pH 
of 8.0, being a compromise between the presence of the reactive thiolate 
and the presence of reactive amines. 

Protein samples for comparison may be derived from a variety of 
cell sources, including all normal and transformed cells derived from any 
recognised source with respect to species (eg. human, rodent, simian), 
tissue source (eg. brain, liver, lung, heart, kidney skin, muscle) and cell 
type (eg. epithelial, endothelial). There are established protocols available 
for the culture of diverse cell types. (See for example, Freshney, R.I., 
Culture of Animal Cells: A Manual of Basic Technique, 2 nd Edition, Alan 
R.Liss Inc. 1987). This invention may also be used to compare samples 
from plants using intact plants or cultured plant cells. In addition, 
samples for use in the method of the invention may be derived from cells 
which have been transfected with recombinant genes and cultured, cells 
which have been subjected to an external stimulus (such as heat shock or 
drug treatment) or other biological fluids (such as cerebrospinal fluid or 
serum). The present invention may also be used to target subsets of 
proteins present in a cell prior to labelling; for example, by isolating 
particular fractions such as low molecular weight proteins or pi range; 
sub-cellular compartments such as nuclear proteins. 



WO 2004/005933 ^fcCT/GB2002/003142 

-17- 



Those skilled in the art will recognise that protein samples may be 
extracted from such samples using a variety of methods and extraction 
reagents. Typically, cells from the tissue/culture are disrupted, for 

5 example by homogenisation, sonication, cell lysis, and the protein 

extracted and solubilised in the presence of reagents including denaturing 
reagents, such as urea, thiourea, detergents such as SDS, CHAPS, Triton 
X-100, NP-40, reducing agents, such as dithiothreitol (DTT), 
mercaptoethanol, and buffer such as Tris, Hepes, Protease inhibitors , 

10 such as phenylmethanesulphonyl fluoride (PMSF), 

ethylenediaminetetraacetic acid (EDTA), leupeptin, aprotinin, may also be 
added to minimise degradation by endogenous proteases. 

The matched set of fluorescent dyes and the labelling method may 

15 be used for detecting differences in the protein content of at least two 

different protein samples, for example, according to the method described 
by Minden, J. et al (Electrophoresis, (1997), 1_8, 2071). In a typical 
example of the method, protein is extracted from each of 2 different 
samples by known methods as described above and the protein 

20 concentration determined. To a 50jig aliquot of each protein extract is 
added 10nmol of TCEP (tris-(2-carboxyethyI)phosphine) and this is 
incubated for one hour at 37°C. To each of these reduced protein 
samples is added 20nmol of dye and incubated for 30 minutes at 37°C. 
The first protein sample is labelled with the first dye (for example Cy3) of 

25 a matched set of dyes and the second protein sample is labelled with the 
second dye of the matched set of dyes (for example Cy5). The reaction 
is stopped by the addition of sample buffer containing DTT. The labelled 
protein samples are mixed to give a single sample for separation. The 
samples are electrophoretically separated, preferably by 2D-PAGE. The 

30 procedures for electrophoretic separation are well known to those skilled 
in the art. 
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In a fourth aspect of the invention, a kit is provided, said kit 
comprising a matched set of fluorescent dyes comprising at least two 
different fluorescent dyes having the formula (I): 



10 



15 



20 



25 




(I) 

wherein n is different for each said dye and is 1 , 2, or 3; 

Z 1 and Z 2 independently represent the carbon atoms necessary to 

complete a phenyl or naphthyl ring system; 

one of groups R 1 and R 2 is the group: 

O 

— (CH2) p C-NH (CH2) q Y 

where Y is a target bonding group; 

remaining group R 1 or R 2 is selected from -(CH 2 )4-W or _(CH 2 )r-H; 
group R 3 is hydrogen, except when either R 1 or R 2 is -(CH 2 ) r -H, in which 
case R 3 is W; 

W is selected from sulphonic acid and sulphonate; 
p is an integer from 3 to 6; 
q is selected to be 2 or 3; and 
r is an integer from 1 to 5; 
and their salts; 

characterised in that when n of two of said dyes differs by + 1 , one of p, 
q and r of said two dyes differs by -1 . 
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The invention is further illustrated by reference to the following 
examples and figures in which: 

Figure 1 illustrates a typical workflow for differential analysis of protein 
5 samples using cysteine reactive dyes with 2D electrophoresis. Two 
different protein samples are prepared from cells, tissues or other 
biological fluids. The first and second protein samples may be any two 
samples in which it is desired to compare the protein content, for 
example, from a normal and diseased tissue, or from control versus drug 
10 treated/stimulated cells. The method may also be applied to one 
dimensional separation systems. 

Figure 2 shows overlay images of proteins labelled with dye sets 1,12 
and 9 and separated by 2D electrophoresis with outlines of labelled 
15 protein spots to demonstrate the positional matching. 

Examples 

Example 1 . Synthesis of Dyes 

20 

i) General Experimental Procedures 

1H NMR (8h) spectra were recorded on a Jeol JNM-LA300 FT NMR 
spectrometer. Chemical shifts are reported in 8 (ppm). Samples were 
25 prepared as solutions in a suitable deuterated solvent such as d4- 

methanol. UV/VIS spectroscopy was conducted using the Unicam UV3 
UV/VIS spectrometer. Trimethoxypropene was purchased from Karl 
Industries Inc., Ohio, USA. All other chemicals were purchased from 
Sigma-Aldrich Company Limited, Dorset, England. 
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N-BOC-Aminoethylmaleimide and N-BOC-aminopropylmaleimide 
were prepared according to the literature description in J. Org. Chem., 
(1995), 60, 5352-5355. 

ii) Potassium 2,3,3-trimethylindolenine-5-sulphonic acid 

Hydrazinobenzenesulphonic acid (20. Og) was dissolved in acetic 
acid (60ml) and 3-methyl-2-butanone (26. Og) added then heated at reflux 
for 3 hours. The desired compound was precipitated by cooling in the 
fridge with scratching and the off white slurry was diluted with propan-2- 
ol and filtered (71 %). 

The 2,3,3-trimethyl-5-sulphonyl-indoIenine (16.45g) was dissolved 
in methanol (160ml) with heating and a saturated solution of KOH in 
propan-2-oI (100ml) was added. The solution changed to a yellow colour 
and a solid formed. The solution was cooled and the solid was filtered to 
form an off-white solid (15.9g, 98%). 5h (300MHz, CDaOD) 7.84 (m, 2H), 
7.46 (d, 1H), 3.30 (s, 3H) and 1.35 (s, 6H). 

iii) 1 ,2,3,3-Tetramethyl-5-sulphonyl-indolium iodide 

Potassium 2,3 / 3-trimethylindolenine-5-sulphonic acid (1.0g, 
3.61 mmol) and iodomethane (0.25ml, 3.97mmol) were mixed with 
dichlorobenzene (10ml) under a nitrogen atmosphere. The solution was 
heated at 100°C using a sand bath for 4 hours. A solid had begun to 
form but analysis by tic (30% MeOH/70% DCM) showed product 
formation was not complete so an additional equivalent of iodomethane 
was added and the reaction heated for an additional 2 hours before 
cooling to room temperature. The solid was collected by filtration, 
washed with dichlorobenzene, diethyl ether then dried in vacuo to afford 
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a purple solid (0.899, 98%). 5h (300MHz, CDsOD) 8.06 (m, 1H), 7.94 
(dd, 1H), 7.84 (m, 1H), 4.02 (s, 3H) and 1.61 (s, 6H). 

iv) 1 -Ethyl-2, 3,3-trimethyl-5-sulphonyHndolium iodide 



Potassium 2,3,3-trimethylindolenine-5-sulphonic acid {10. Og, 
41.97mmol) and iodoethane (4.0ml, 50.35mmol) were mixed with 
dichlorobenzene (40ml) under a nitrogen atmosphere. The solution was 
heated at 1 20°C using a sand bath for 16 hours producing a purple solid. 
10 The solid was collected by filtration then washed with dichlorobenzene, 
chloroform and ether to produce pale pink solid, (10. 2g, 91 %). 8h 
(300MHz, CDaOD) 7.98 (m, 3H), 4.55 (q, 2H), 1.56 (s, 6H) and 1.48 (t, 
3H). 

15 v) 1 -Propyl-2,3,3-trimethyl-5-sulphonyl-indolium iodide 

Potassium 2,3,3-trimethylindoIenine-5-sulphonic acid (1.0g, 
3.61 mmol) and iodopropane (0.4ml, 3.97mmoI) were mixed with 
dichlorobenzene (10ml) under a nitrogen atmosphere. The solution was 
20 heated at 100°C using a sand bath for 20 hours producing a red-brown 
colour gelatinous solid. The solid was collected by filtration and then 
washed with dichlorobenzene, chloroform and diethyl ether to afford a 
pink solid, (472mg, 47%). 8h (300MHz, CDaOD) 7.91 (m, 3H), 4.50 (t, 
2H), 2.01 (dt, 2H), 1.56 (s, 6H) and 1.13 (t, 3H). 

25 

vi) 1 -Sulphobutyl-2,3,3-trimethvl-5-sulphonyl-indolium iodide 

2,3,3-Trimethylindolenine (2.0g, 12.6mmol) and 1 ,4-butanesultone 
(1.7g, 12.6mmol) were mixed together and heated at 100°C for 6 hours. 
30 The solution gradually turned red and after the 6 hours the reaction was 
cooled to room temperature. The solid was dispersed in diethyl ether and 
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filtered. The solid was collected and dried. 8h (300MHz, CDaOD) 7.91 (m, 
1H), 7.75 (m, 1H), 7.42 (m, 2H), 4.58 (m, 2H), 2.93 (m, 2H), 2.18 (m, 
2H), 1.94 (m, 2H) and 1.61 (s, 6H). 



5 vii) 1 -(5-Carboxypentyl)-2 / 3 / 3-trimethvl-5-sulphonyl-indolium iodide 

2,3,3-Trimethylindolenine (6.4g, 40mmol) was dissolved in 
dichlorobenzene (25ml) and stirred until the solution was homogenous. 
To this was added 6-bromohexanoic acid (15.6g, 80mmol) and the 

10 reaction heated to 1 10°C in a sand bath for 6.5 hours. The reaction was 
allowed to cool to room temperature where the sides of the flask were 
scratched then the flask was placed in the fridge for 1 hour. After this 
time, a beige solid had formed in the purple solution so the solid was 
collected by filtration then washed with dichlorobenzene and ether to 

15 afford a beige solid (7.42g, 52%). 5h (300MHz, CD3OD) 7.91 (m, 1H), 
7.78 (m, 1H), 7.62 (m, 2H), 4.52 (t, 2H), 2.38 (t, 2H), 2.04 (p, 2H), 
1.88-2.45 (m, 4H) and 1.61 (s, 6H). 

viii) 1-(4-Carboxybutyl)-2 / 3 / 3-trimethyl-5-sulphonyl-indolium iodide 

20 

2,3,3-Trimethylindolenine (1 1 .3g, 40mmol) was dissolved in 
dichlorobenzene (30ml) and stirred until the solution was homogenous. 
To this was added 5-bromobutanoic acid (19.3g, 106.6mmol) and the 
reaction heated to 1 10°C in a sand bath for 6.5 hours. The reaction was 

25 allowed to cool to room temperature where the sides of the flask were 
scratched then the flask was placed in the fridge for 1 hour. After this 
time a beige solid had formed in the purple solution so the solid was 
collected by filtration then washed with dichlorobenzene and ether to 
afford a beige solid (18.0g, 75%). 8h (300MHz, CD3OD) 7.90 (m, 1H), 

30 7.78 (m, 1H), 7.62 (m, 2H), 4.59 (t, 2H), 2.41 <t, 2H), 2.04 (m, 2H), 
1.76 (m, 2H) and 1.61 (s, 6H). 
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ix). Aminoethylmaleimide and Aminopropylmaleimide 

To a solution of 4M hydrochloric acid in dioxane was added the 
5 appropriate BOC-aminoalkylmaleimide and the reaction stirred at room 
temperature for 30 minutes. After this time the reaction had deposited a 
solid and the solvents were removed in vacuo to reveal a white fluffy 
solid. The compound was used crude in subsequent steps. 
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10 x) 1-(6-{[2-(2 / 5-Dioxo-2 / 5-dihydro-1/y-pyrrol-1-yl)ethyl]amino)-6- 
oxohexyl)-2-[(1E,3£)-3-(1-ethyl-3 y 3-dimethyl"5-sulpho-1 ,3-dihydro-2/y- 
indol-2-ylidene)prop-1 -enyl]-3, 3-dimethyl-3/Y-indolium (Compound I) 

1-Ethyl-2,3,3-trimethyl-5~sulphonyl-indolium iodide (2.0g, 
15 7.48mmoI), N,N'-diphenylformamidine (1.5g, 7.48mmol) and 

triethylorthoformate (1 .1g, 7.48mmol) were dissolved in ethanol (10ml) 
then heated at reflux (100°C) for 3 hours. A solid formed on the sides of 
the reaction flask and UV/VIS showed a new peak at 408nm. Diethyl 
ether was added and the precipitate and the solid collected by filtration, 
20 washed with ether and dried in vacuo to afford a yellow/orange solid 
(1.83g, 66%). UV/VIS (MeOH); absorption Xmax = 408nm. 



To a solution of the Cy3 half-dye (1 .83g, 6.22mmol) in anhydrous 
pyridine (10ml) was added acetic anhydride (1 .0ml) and the reaction 

25 stirred under a nitrogen atmosphere for 10 minutes. After this time 1-(5- 
carboxypentyl)-2,3,3-trimethyI indolium bromide (2.2g, 6.22mmol) was 
added and the reaction stirred at room temperature for 1 6 hours. The 
progress of the reaction was monitored by tic (20% MeOH/80% DCM). 
The solvent was removed under reduced pressure and purified using flash 

30 column chromatography (reversed phase silica: water-50% methanol 
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gradient) to yield 299mg of the desired Cy3 acid product (9%). UV/VIS 
(MeOH); absorption Am ax — 550nm. 

Cy3 acid (200mg / 0.36mmol) was dissolved in anhydrous DMF 
5 under a nitrogen atmosphere then stirred at room temperature. DIPEA 
(80jal, 0.40mmol) and TSTU (120mg, 0.40mmol) were added and the 
reaction stirred for 2 hours until deemed complete to the NHS ester by tic 
(20% MeOH/80% DCM). The NHS ester was treated with a second 
equivalent of DIPEA (80p.l, 0.40mmol) and aminoethylmaleimide (80mg, 
10 0.40mmol) were added and the reaction allowed to stir for 2 hours. Thin 
layer chromatography (Tic) showed conversion to a new product so the 
reaction was diluted with diethyl ether (50ml) and the solvents decanted 
to leave a pink residue. Flash column chromatography (silica: DCM-40% 
methanol gradient) afforded the desired maleimide product (98mg, 34%). 
15 UV/VIS (MeOH); absorption Xma* = 550nm. 6h (300MHz, CDsOD) 8.56 (t, 
1H), 7.92 <m, 2H), 7.62-7.38 (m, 5H), 6.72 (s, 2H), 6.50 (dd, 2H), 4.19 
(m, 4H), 3.52 (m, 2H), 2.15 (t, 2H), 1 .94-1 .56 (m, 6H), 1 .75 (s, 1 2H) 
and 1.44 (t, 3H). 

20 xi) 1 -(6-([2-(2,5-Dioxo-2,5-dihydro-1 //-pyrrol-1 -yl)ethyl]amino)-6- 
oxohexyl)-3 / 3-dimethyl-2-[(1£ / 3£ / 5£)-5-(1 / 3 y 3-trimethyl-5-sulpho-1 ,3- 
dihydro-2A/-indol-2-ylidene)penta-1 ,3-dienyl]-3rt-indolium (Compound II) 



25 1 1 .90mmol) was suspended in a mixture of acetic acid (40ml) and TFA 
(2ml, 18.0mmol) until all of the solid dissolved. 1 ,3,3-Trimethoxypropene 
(12.5ml, 95.0mmol) was added to the reaction and stirred at room 
temperature for 5 hours. The solution was pipetted into 500ml of diethyl 
ether and the precipitate collected by filtration (4.80g, contains salts). 



1 ^^^-Tetramethyl-S-sulphonyl-indolium iodide (5.00g, 
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To a solution of the Cy5 half-dye (4.80g, 14.9mmol) in methanol 
(40ml) was added potassium acetate (3.00g, 34.2mmol) and 1-(5- 
carboxypentyl)-2,3,3-trimethyl indolium bromide (3.00g, 16.4mmol). 
After stirring overnight, the solution was pipetted into diethyl ether 
5 (500ml) and the blue solid collected by filtration and dried in vacuo. 
Purification was achieved by flash column chromatography (reversed 
phase silica: water-50% methanol gradient) to yield 2.30g of desired 
product (28%). UV/VIS (MeOH); absorption A,max = 642nm. 

10 Cy5 acid (200mg, 0.36mmol) was dissolved in anhydrous DMF 

under a nitrogen atmosphere then stirred at room temperature. DIPEA 
(80pJ, 0.40mmol) and TSTU (120mg, 0.40mmol) were added and the 
reaction stirred for 2 hours until deemed complete to the NHS ester by tic 
(20% MeOH/80% DCM). The NHS ester was treated with a second 

15 equivalent of DIPEA (80jj,I, 0.40mmol) and aminoethylmaleimide (80mg, 
0.40mmol) were added and the reaction allowed to stir for 2 hours. Tic 
showed conversion to a new product so the reaction was diluted with 
diethyl ether (50ml) and the solvents decanted to leave a blue residue. 
Flash column chromatography (silica: DCM-40% methanol gradient) 

20 afforded the desired maleimide product (105mg, 43%). UV/VIS (MeOH); 
absorption kmax = 644nm. 8h (300MHz, CDaOD) 8.18 (dt, 2H), 7.82 (m, 
2H), 7.56-7.24 (m, 5H), 6.80 (s, 2H), 6.64 (t, 1H), 6.45 (d, 1H), 6.24 (d, 
1H), 4.17 (t, 2H), 3.60 (s, 3H), 3.40 (t, 2H), 2.09 (t, 2H), 1.89-1.48 (m, 
6H) and 1.79 (s, 12H). 

25 

xii) 1 -(6-([2-(2,5-Dioxo-2,5-dihvdro-1 rt-pyrrol-1 -yl)ethyllamino)-6- 
oxohexyD^-tdE^a-S-d-propyl-S^-dimethyl-S-sulpho-l ,3-dihydro-2/7- 
indol-2-ylidene)prop-1 -enyl]-3,3-dimethyl-3/V-indolium (Compound III) 

30 1 -(S-CarboxypentyD^^^-trimethyl-S-sulphonyl-indolium (e.Og, 

16.94mmol) and N.N'-diphenylformamidine (e.eSg, 33.87mmol were 
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dissolved in acetic acid (20ml) then heated at reflux (120°C) for 4 hours. 
The reaction was allowed to cool to room temperature then the solvents 
removed in vacuo. The orange oil was dissolved in chloroform and 
washed with water then dried with magnesium sulphate and concentrated 
to an orange oil of the Cy3 half-dye. This oil was used without further 
purification. 

To a solution of the Cy3 half-dye (1 .00g, 2.1 8mmol) in anhydrous 
pyridine (10ml) was added acetic anhydride (1.0ml) and the reaction 
stirred under a nitrogen atmosphere for 10 minutes. After this, 1-propyl- 
2,3,3-trimethyl-5-sulphonyl-indoIium iodide (0.92g, 3.28mmol) was added 
and the reaction stirred at room temperature for 1 6 hours. The progress 
of the reaction was monitored by tic (20% MeOH/80% DCM). The 
solvent was removed under reduced pressure and purified using flash 
column chromatography (silica: DCM-40% methanol gradient) to yield 
241 mg of the desired Cy3 acid product (21 %). UV/VIS (MeOH); 
absorption A. max — 550nm. 

Cy3 acid (100mg, 0.18mmol) was dissolved in anhydrous DMF 
under a nitrogen atmosphere then stirred at room temperature. DIPEA 
(30jal, 0.20mmol) and TSTU (54mg / 0.20mmol) were added and the 
reaction stirred for 2 hours until deemed complete to the NHS ester by tic 
(20% MeOH/80% DCM). The NHS ester was treated with a second 
equivalent of DIPEA (30(il, 0.20mmol) and aminoethylmaleimide (40mg, 
0.20mmol) were added and the reaction allowed to stir for 2 hours. Tic 
showed conversion to a new product so the reaction was diluted with 
diethyl ether (50ml) and the solvents decanted to leave a pink residue. 
Flash column chromatography (silica: DCM-40% methanol gradient) 
afforded the desired maleimide product (41 mg, 34%). UV/VIS (MeOH); 
absorption 7U» = 550nm. 5h (300MHz, CD 3 OD) 8.52 (t, 1H), 7.90 (m, 
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2H), 7.60-7.39 (m, 5H), 6.78 (s, 2H), 6.52 (dd, 2H), 4.22 (m, 4H), 3.56 
(m, 2H), 2.13 (t, 2H), 1.97-1.56 (m, 8H), 1.75 (s, 12H) and 1.09 (t, 3H). 



xiii) 1-(6-{[2-(2 / 5-Dioxo-2 y 5-dihydro-1^pyrrol-1-yl)ethyl]amino)-6- 
5 oxohexyl)-3,3-dimethyl-2-[(1^ 

1 ,3-dihydro-2A/-indol-2-ylidene)penta-1 ,3-dienyl]-3/7-indolium (Compound 
IV) 



1-(5-Carboxypentyl)-2 / 3 / 3-trimethyl-5-sulphonyl-indolium (4.0g, 
10 14.59mmol) and malonaldehyde bis(phenylimine) hydrochloride (7.55g, 
29.18mmol) were dissolved in acetic acid (20ml) then heated at reflux 
(120°C) for 4 hours. The reaction was allowed to cool to room 
temperature then the solvents removed in vacuo. The red oil was 
dissolved in chloroform and washed with water then dried with 
15 magnesium sulphate and concentrated to a red oil of the Cy5 half-dye. 
This oil was used without further purification. 



To a solution of the Cy5 half-dye (1.00g, 2.07mmol) in anhydrous 
pyridine (10ml) was added acetic anhydride (1 .0ml) and the reaction 

20 stirred under a nitrogen atmosphere for 10 minutes. To this 1-ethyl- 

2,3,3-trimethyl-5-sulphonyl-indolium iodide (0.59g, 2.28mmol) was added 
and the reaction stirred at room temperature for 1 6 hours. The progress 
of the reaction was monitored by tic (20% MeOH/80% DCM). The 
solvent was removed under reduced pressure and purified using flash 

25 column chromatography (silica: DCM-40% methanol gradient) to yield 
365mg of the desired Cy5 acid product (31 %). UV/VIS (MeOH); 
absorption A,max = 644nm. 



30 



Cy5 acid (52mg, 0.09mmol) was dissolved in anhydrous DMF 
under a nitrogen atmosphere then stirred at room temperature. DIPEA 
(10^1, O.IOmmol) and TSTU (28mg, O.IOmmol) were added and the 
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reaction stirred for 2 hours until deemed complete to the NHS ester by tic 
(20% MeOH/80% DCM). The NHS ester was treated with a second 
equivalent of DIPEA (10pJ, O.IOmmol) and aminoethylmaleimide (18mg, 
O.IOmmol) were added and the reaction allowed to stir for 2 hours. Tic 

5 showed conversion to a new product so the reaction was diluted with 
diethyl ether (30ml) and the solvents decanted to leave a blue residue. 
Flash column chromatography (silica: DCM-40% methanol gradient) 
afforded the desired maleimide product (29mg, 47%). UV/VIS (MeOH); 
absorption W = 644nm. 8h (300MHz, CDsOD) 8.38 (m, 2H), 7.88 (m, 

10 2H), 7.55-7.24 (m, 5H), 6.80 (s, 2H), 6.71 (t, 1H), 6.42 (d, 1H), 6.24 (d, 
1H), 4.11 (m, 4H), 3.54 (t, 2H), 2.12 (t, 2H), 1.92-1.36 (m, 9H) and 
1.81 (s, 12H). 

xiv) 1-(5-{[2-(2 / 5-Dioxo-2 / 5-dihvdro-1/y-pyrrol-1-yl)ethyl]amino)-6- 
15 oxopentyl)-3 y 3-dimethyl-2-[(1£,3£" / 5£)-5-(1-ethyl-3,3-trimethyl-5-sulpho- 
1 ,3-dihydro-2rt-indol-2-ylidene)penta-1 ,3-dienyl]-3M-indolium (Compound 
V) 



1-(4-Carboxybutyl)-2,3,3-trimethyI indolium bromide (2.00 g, 5.88 
20 mmol) and malonaldehyde bis(phenylimine) hydrochloride (2.28g, 

8.82mmol) were dissolved in acetic acid (30ml) then heated at 1 20°C for 
6 hours. The reaction was then allowed to cool before the acetic acid 
was removed in vacuo afford a mobile oil. This was dissolved in 
chloroform and washed with water, dried with magnesium sulphate, 
25 filtered and concentrated in vacuo to afford a more viscous oil. Most of 
the unreacted malonaldehyde bis(phenylimine) remained at the interface 
of the chloroform/ water. The compound was purified using flash column 
chromatography (dichloromethane-30% methanol gradient) to afford a red 
solid (120mg, 5%). 



30 



WO 2004/005933 



m 



T/GB2002/003142 



-29- 



To a solution of the Cy5 half-dye (120mg, 0.31 mmol) in anhydrous 
pyridine (3ml) was added acetic anhydride (0.5ml) and the reaction stirred 
under a nitrogen atmosphere for 10 minutes. After this time f the 1-ethyl- 
2,3,3-trimethyl-5-sulphonyl-indoIium iodide (83mg, 0.31 mmol) was added 
5 and the reaction stirred at room temperature overnight. The solvent was 
removed under reduced pressure and purified using flash column 
chromatography (silica: dichloromethane-40% methanol gradient) to yield 
84 mg of desired product (48%). UV/VIS (MeOH); absorption Xmax = 
644nm. 



Cy5 (84mg / 0.15mmol) was dissolved in anhydrous DMF under a 
nitrogen atmosphere then stirred at room temperature. DIPEA (26)11, 
0.15mmol) and TSTU (45mg, 0.15mmol) were added and the reaction 
stirred for 2 hours until deemed complete to the NHS ester by tic (20% 

15 MeOH/80% DCM). The NHS ester was treated with a second equivalent 
of DIPEA (DIPEA (26jal, 0.15mmol) and then aminoethylmaleimide (52mg / 
0.30mmol) was added and the reaction allowed to stir for an additional 2 
hours. Tic showed conversion to a new product, so the reaction was 
diluted with diethyl ether (50ml) and the solvents decanted to leave a pink 

20 residue. Flash column chromatography (silica: DCM-40% methanol 

gradient) afforded the desired maleimide product (30 mg, 30%). UV/VIS 
(MeOH); absorption JU« = 644nm. 5h (300MHz, CD 3 OD) 8.15 (q, 2H), 
7.80 (m, 2H), 7.52-7.31 (m, 5H), 6.75 (s, 2H), 6.64 (t, 1H), 6.46 (d, 
1H), 6.32 (d, 1H), 4.27 (m, 4H), 4.15 (t, 2H), 2.09 (t, 2H), 1.79 (s, 6H), 

25 1 .65 (s, 6H) 2.56-2.44 (m, 4H) and 1 .12 (t, 3H). 
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xv) 1 -(6-{[2-(2,5-Dioxo-2,5-dihydro-1 //-pyrrol- 1 -yl)ethyl]amino)-6- 
oxohexvl)-2-[(1f,3£)-3-(3,3-dimethyl(1-suto^ 
2-ylidene)prop-1-enyl]-3,3-dimethyl-3//-indolium (Compound VI) 



1.41mmol), N,N'-diphenylformamidine (278mg / 1.41mmol) and 1- 
sulphobutyI-2,3,3-trimethylindolium iodide (418mg, 1.41mmol) were 
dissolved in anhydrous pyridine (5ml) and stirred at room temperature. 
Acetic anhydride (0.4ml) was then added and the reaction stirred 
10 overnight at ambient temperature. The pyridine was removed in vacuo 
and the magenta oil purified by flash column chromatography 
(dichloromethane-1 0% methanol gradient) to afford a pink solid (67mg f 
8%). 

15 The Cy3 acid (50mg, 0.09mmol) was dissolved in anhydrous DMF 

under a nitrogen atmosphere then stirred at room temperature. DIPEA 
(20jjJ, 0.09mmol) and TSTU (59mg, 0.18mmol) were added and the 
reaction stirred for 2 hours until deemed complete to the NHS ester by tic 
(20% MeOH/80% DCM). The NHS ester was treated with a second 

20 equivalent of DIPEA (20|il, 0.09mmol) and then the aminoethylmaleimide 
(32mg, 0.18mmol) was added and the reaction allowed to stir for an 
additional 2 hours. The reaction was diluted with diethyl ether (50ml) and 
the solvents decanted to leave a pink residue. Flash column 
chromatography (silica: DCM-40% methanol gradient) afforded the 



25 desired maleimide product (24mg, 43%). UV/VIS (MeOH); absorption A-max 
= 550nm. 5h (300MHz, CDsOD) 8.49 (t, 1H), 7.52-7.35 (m, 8H), 6.72 
(s, 2H), 6.46 (dd, 2H), 4.09 (m, 4H), 3.57 (m, 2H), 2.85 (m, 2H), 2.15 
(t, 2H), 1.94-1.43 (m, 10H) and 1.85 (s, 12H). 



5 



1-(5-Carboxypentyl)-2 f 3 / 3-trimethyl indolium bromide (500mg, 
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xvi) 1-(5-{[2-(2 / 5-Dioxo-2 / 5-dihvdro-1/y-pyrrol-1-vl)ethvl]amino)-6- 
oxopentvl)-3,3-dimethyl-2-[(1E,3£,5£)-5-(3^ 

1 ,3-dihydro-2//-indol-2-ylidene)penta-1 ,3-dienyl]-3//-indolium (Compound 
VII) 



1-(4-Carboxybutyl)-2,3,3-trimethyl indolium bromide (1.0g, 
2.94mmol) and malonaldehyde bisphenylimine hydrochloride (760mg, 
2.94mmol) were dissolved in acetic acid (10ml) then heated at 1 20°C for 
18 hours. The reaction was then allowed to cool before the acetic acid 

10 was removed in vacuo afford a mobile red oil. This was dissolved in 
chloroform and washed with water, dried with magnesium sulphate, 
filtered and concentrated in vacuo to afford a more viscous red/brown oil. 
The compound was purified using flash column chromatography 
(dichloromethane-20% methanol gradient) to afford an orange solid 

15 (1.43g, 55%). 

To a solution of the Cy5 half-dye (120mg, 0.26mmol) in anhydrous 
pyridine (5ml) was added acetic anhydride (0.5ml) and the reaction stirred 
under a nitrogen atmosphere for 10 minutes. After this time 1- 
20 suIphobutyl-2,3,3-trimethylindolium iodide (83mg, 0.28mmol) was added 
and the reaction stirred at room temperature for 2 hours. The solvent 
was removed under reduced pressure and purified using flash column 
chromatography (silica: dichloromethane-20% methanol gradient) to yield 
78 mg of desired product (51 %). 



The Cy5 acid (50mg, 0.09mmol) was dissolved in anhydrous DMF 
under a nitrogen atmosphere then stirred at room temperature. DIPEA 
(15|il, O.IOmmol) and TSTU (28mg, O.IOmmol) were added and the 
reaction stirred for 2 hours until deemed complete to the NHS ester by tic 
30 (20% MeOH/80% DCM). The NHS ester was treated with a second 
equivalent of DIPEA (15jil, O.IOmmol) and then aminoethylmaleimide 
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(30mg, 0.17mmol) was added and the reaction allowed to stir for an 
additional 2 hours. Tic analysis showed conversion to a new product. 
The reaction was diluted with diethyl ether (50ml) and the solvents 
decanted to leave a blue residue. Flash column chromatography (silica: 
DCM-40% methanol gradient) afforded the desired maleimide product 
(30mg, 30%). UV/VIS (MeOH); absorption Xmax = 644nm. Sh (300MHz, 
CDsOD) 8.19 (t, 2H), 7.52-7.42 (m, 8H), 6.71 ( S/ 2H), 6.64 (t, 1H), 6.36 
(dt, 2H), 4.18 (m, 4H), 3.55 (m, 2H), 2.89 (m, 2H), 2.15 (t, 2H), 1.79 (s, 
12H) and 2.01-1.36 (t, 8H). 

xvii) 1 -(6'{[3-(2 / 5-Dioxo-2 / 5-dihydro-1 //-pyrrol-1 -yl)propyl]amino}-6" 
oxohexyD^-KI^Sa-S-d-ethyl-S^-dimethvl-S-sulpho-l^-dihydro^/y- 
indol-2-ylidene)prop-1 -enyll-S^-dimethyl-S/y-indolium (Compound VIII) 

1-(5-Carboxypentyl)-2,3,3-trimethyl indolium bromide (354mg, 
1 .OOmmol), N^'-diphenylformamidine (196mg, 1 .OOmmol) and 1-ethyl- 
2 / 3 / 3-trimethyl-5-sulphonyI-indolium iodide (267mg, 1 .OOmmol) were 
dissolved in anhydrous pyridine (15ml) and stirred at room temperature. 
Acetic anhydride (0.5ml) was then added and the reaction stirred 
overnight at ambient temperature. The pyridine was removed in vacuo 
and the magenta oil purified by flash column chromatography 
(dichloromethane-40% methanol gradient) to afford a pink solid (30mg, 
6%). 

The Cy3 acid (30mg, 0.05mmol) was dissolved in anhydrous DMF 
under a nitrogen atmosphere then stirred at room temperature. DIPEA 
(10^1, 0.05mmol) and TSTU (2mg, 0.05mmol) were added and the 
reaction stirred for 2 hours until deemed complete to the NHS ester by tic 
(20% MeOH/80% DCM). The NHS ester was treated with a second 
equivalent of DIPEA (10|J, 0.05mmol) and then the aminoethylmaleimide 
(9mg, 0.05mmol) was added and the reaction allowed to stir for an 
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additional 2 hours. The reaction was diluted with diethyl ether (20ml) and 
the solvents decanted to leave a pink residue. Flash column 
chromatography (silica: DCM-40% methanol gradient) afforded the 
desired maleimide product (15mg, 44%). UV/VIS (MeOH); absorption JUax 
= 550nm. 8h (300MHz, CDaOD) 8.53 (t, 1H), 7.95 (m, 2H), 7.59-7.41 
(m, 5H), 6.78 ( S/ 2H), 6.50 (t, 2H), 4.17 (m, 4H), 3.50 (m, 2H), 3.14 (t, 
2H), 2.14 (t, 2H), 1.98-1.28 (m, 1 1H) and 1.79 (s, 12H). 



xviii) 1 -(6-([3-(2,5-Dioxo-2,5-dihydro-1 rt-pyrrol-1 -yl)propyl]amino)-6- 
10 oxohexyl)-2-[(1£ f 3£)-3-(3,3-dimethyl(1-sulpho-butvl)-1,3-dihydro-2/y-indol- 
2-ylidene)prop-1-enyl]-3 f 3-dimethyl-3/Y-indolium (Compound IX) 

1-(5-CarboxypentyI)-2,3,3-trimethyl indolium bromide (6.0g, 
16.95mmol) and N,N'-diphenylformamidine (6.63g, 33.87mmol) were 
15 dissolved in acetic acid (20ml) then heated at 1 20°C for 5 hours. The 
acetic acid was removed in vacuo and the residue purified using column 
chromatography (silica: dichloromethane-20% methanol gradient) 
yellow/orange solid (1 .34g, 21 %). 



20 To a solution of the Cy3 half-dye (250mg, 0.55mmol) in anhydrous 

pyridine (5ml) was added acetic anhydride (0.5ml) and the reaction stirred 
under a nitrogen atmosphere for 10 minutes. After this time 1- 
sulphobutyI-2,3,3-trimethylindolium iodide (161mg, 0.55mmol) was 
added and the reaction stirred at room temperature for 22 hours. The 

25 solvent was removed under reduced pressure and purified using flash 

column chromatography (silica: dichloromethane-20% methanol gradient) 
to yield 1 42mg of the desired Cy3 acid product (45%). 



30 



Cy3 acid (50mg, 0.09mmol) was dissolved in anhydrous DMF 
under a nitrogen atmosphere then stirred at room temperature. DIPEA 
(15|il, O.IOmmol) and TSTU (28mg, O.IOmmol) were added and the 
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reaction stirred for 2 hours until deemed complete to the NHS ester by tic 
(20% MeOH/80% DCM). The NHS ester was treated with a second 
equivalent of DIPEA (15^.1, O.IOmmol) and aminopropylmaleimide (18mg, 
O.IOmmol) were added and the reaction allowed to stir for 2 hours. The 

5 reaction was diluted with diethyl ether (50ml) and the solvents decanted 
to leave a pink residue. Flash column chromatography (silica: DCM-40% 
methanol gradient) afforded the desired maleimide product (15mg, 24%). 
5h (300MHz, CD3OD) 8.49 (t, 1H), 7.78-7.22 (m, 8H), 6.72 (s, 2H), 6.50 
(t, 2H), 4.12 (m, 4H), 3.47 (m, 2H), 3.10 (m, 2H), 2.85 (m, 2H), 2.15 (t, 

10 2H), 2.01-1.24 (m, 12H) and 1.85 (s, 12H). 



xix) 1 -(6-{[2-(2,5-Dioxo-2,5-dihydro-1 rt-pyrrol-1 -yl)ethyl]amino}-6- 
oxohexyD-S.S-dimethyl^-tdE.SE.S^-S-O.S-dimethyl-d-sulpho-butyD-l ,3- 
dihydro-2/7-indol-2-ylidene)penta-1 ,3-dienyl]-3//-indolium (Compound X) 

15 

1-(5-Carboxypentyl)-2 / 3 / 3-trimethyI indolium bromide (353mg, 
1 .OOmmol), malonaldehyde bisphenylimine hydrochloride (259mg, 
1 .OOmmol) and 1 -sulphobutyl-2,3,3-trimethylindolium iodide (295mg, 
1 .OOmmol) were dissolved in anhydrous pyridine (5ml) and stirred at room 
20 temperature. Acetic anhydride (0.4ml) was then added and the reaction 
stirred overnight at ambient temperature. The pyridine was removed in 
vacuo and the blue oil purified by flash column chromatography 
(dichloromethane-1 0% methanol gradient) to afford a blue solid (54mg, 
8%). 

25 

The Cy5 acid (49mg, 0.08mmol) was dissolved in anhydrous DMF 
under a nitrogen atmosphere then stirred at room temperature. DIPEA 
(12(^1, 0.09mmol) and TSTU (27mg, 0.09mmol) were added and the 
reaction stirred for 2 hours until deemed complete to the NHS ester by tic 
30 ( 20% MeOH/80% DCM). The NHS ester was treated with a second 
equivalent of DIPEA (1 2jjlI, 0.09mmol) and then aminoethylmaleimide 
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(28mg, 0.16mmol) was added and the reaction allowed to stir for an 
additional 2 hours. Tic showed conversion to a new product so the 
reaction was diluted with diethyl ether (50ml) and the solvents decanted 
to leave a pink residue. Flash column chromatography (silica: DCM-40% 
5 methanol gradient) afforded the desired maleimide product (24mg, 45%). 
UV/VIS (MeOH); absorption W = 644nm. Sh (300MHz, CDsOD) 8.21 (t, 
2H), 7.54-7.26 (m, 8H), 6.80 (s, 2H), 6.64 (t, 1H), 6.38 (dd, 2H), 4.18 
(m, 4H), 3.58 (m, 2H), 2.89 (m, 2H), 2.15 (t, 2H), 1.76 ( S/ 1 2H) and 
2.06-1.39 (m, 10H). 

10 

xx) 1-(6-{[2-(2 / 5-Dioxo-2 / 5-dihydro-1/y-pyrrol-1-yl)ethyl]amino}-6- 
oxohexyD^-td^Sa-S-d^.S-trimethvl-S-sulpho-l^-dihydro^/y-indol^- 
ylidene)prop-1-enyl]-3,3-dimethyl-3A/-indolium (Compound XI) 

15 By similar methods, 1 ,2,3,3-tetramethyl-5-sulphonyl-indolium iodide 

and 1 -(5-carboxypentyl)-2,3,3-trimethyl indolium bromide were reacted 
together with N^'-diphenylformamidine to form the Cy3 acid, which 
when activated to the N-hydroxysuccinimide ester and treated with 
aminoethylmaleimide, was converted to 1 -(6-{[2-(2,5-dioxo-2,5-dihydro- 

20 1/y-pyrrol-1-yl)ethyI]amino}-6-oxohexyl)-2-[(1£ f 3£)-3-(1 ,3,3-trimethyl-5- 
sulpho-1 ,3-dihydro-2A/-indol-2-ylidene)prop-1-enyl]-3,3-dimethyl-3A/- 
indolium. 

Example 2 

25 

2.1 Protein Isolation and Labelling 

Initial experiments were performed on E. coli samples. E.coli strain 
ER1647 (Amersham Biosciences, Buckinghamshire, UK) was grown in 
30 glucose rich MOPS media at 37°C overnight, followed by harvesting by 
centrifugation for 10 minutes at 4°C at 12,000 x g. The cell pellet was 
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washed twice with wash buffer (10mM Tris pH 8.0, 0.5mM magnesium 
acetate). Cells were then resuspended in lysis buffer (8M urea, 4% w/v 
CHAPS, 40mM Tris pH8.0) and lysed by sonication (3x10 second pulses 
on ice). The protein concentration of the E.coli lysate was determined 
using the Bio-Rad Dc Protein Assay as described by the manufacturer 
(Bio-Rad, Hertfordshire, UK). 

Before use, the cyanine dyes were reconstituted in anhydrous DMF 
(Aldrich catalogue code 22,705-6) to give a final concentration of lOmM 
(1 0nmol/|il). The dye was vortexed briefly after addition of DMF to 
ensure the dye was completely dissolved. The reconstituted dye was 
stored at -20°C and used within 2 weeks. 

Disulphide bonds in 50p.g protein in lysis buffer were reduced by 
the addition of 10nmol TCEP [tris-(2-carboxyethyl)phosphine] followed by 
incubation at 37°C for 1 hour. Following reduction, 20nmol reconstituted 
dye was added and incubated for 30 minutes at 37°C. The reaction was 
stopped with an equal volume of 2x sample buffer (Lysis buffer plus 
20mg / ml DTT and 4%(v/v) Pharmalytes 3-10). The samples were 
stored briefly on ice prior to first dimension separation or frozen protected 
from light for longer storage. 

2.2 Preparation of protein samples for separation. 



Equal amounts of protein samples labelled with Cy3 and Cy5 
compounds I - XI were mixed to give the dye pair sets shown in Table 2. 
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Table 2 



Dve Dair set 


Comoound 


Dve Dair set 


Compound 


1 


|| 


7 


VIII 
II 


2 


III 

IV ' 


8 


IX 
VII 


3 


v 


9 


III 

|| 


4 


VI 
VII 


10 


VI 

X 


5 


VIII 
IV 


11 


I 

IV 


6 


IX 

X 


12 


XI 

II 



2.3 Protein Separation by 2D Electrophoresis 



5 2-D electrophoresis was performed using standard Amersham 

Biosciences 2D PAGE equipment and PlusOne™ reagents 
(Buckinghamshire, UK). Immobiline DryStrips (pH3-10 NL or pH 4-7 NL, 
24cm) were rehydrated overnight in 450|al rehydration buffer (8M urea, 
4% w/v CHAPS, 1% Pharmalytes (pH 3-10), 2mg/ml DTT) overlaid with 

10 2.5ml DryStrip Cover Fluid, in an Immobiline DryStrip Reswelling Tray. 
Strips were focused using the Multiphor isoelectric focusing system. 
Prior to 2 nd dimension PAGE, each strip was equilibrated with 10ml 
equilibration buffer A (8M urea, 100mM Tris-HCI pH6.8, 30% v/v 
glycerol, 1 % w/v SDS, 5mg/ml DTT) on a rocking table for 10 minutes, 

15 followed by 10ml equilibration buffer B (8M urea, 100mM Tris-HCI pH6.8, 
30% v/v glycerol, 1 % w/v SDS, 45mg/ml iodoacetamide) for a further 10 
minutes. The strips were then loaded and run on 12% isochratic Laemmli 
SDS-PAGE gels. 
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2.4 Fluorescence Gel Imaging 



Labelled proteins were visualised using the 2920 Masterlmager 
5 (Amersham Biosciences, Buckinghamshire, UK) with the following 
settings: 





Excitation 


Emission 


Cy3 


540nm 
(25nmBP) 


590nm 
(35nmBP) 


Cy5 


620nm 
(30nmBP) 


680nm 
(30nmBP) 



Exposure times were optimised for individual experiments to give a 
maximum pixel value on the image of 50,000 to avoid saturation of the 
10 signal. Data from 2D-Master was exported as TIF files into PaintShop 
Pro™ to generate colour overlays for visual inspection. For detailed 
quantitative analysis of dye matching, gel images were exported into 2D 
Image Master software programme. 



15 2.5 Image Analysis 



Gel analysis in this study was performed using 2D Image Master 
(Amersham Biosciences, Buckinghamshire, UK) a 2-D analysis software 
platform. Following spot detection, the centre of each spot was used to 

20 generate pixel co-ordinates for each of the Cy3 and Cy5 images. The 
extent of migration matching was determined by comparing the co- 
ordinate positions of the centres of the Cy3 and Cy5 spots. This can be 
used to describe positional matching in terms of the number of matched 
spots within ± 2 pixels of each other in both the x (pi) and y (mass) co- 

25 ordinates. 
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As both dyes have been used to label the same sample the same 
proteins are present in each labelled sample and thus if the dyes are 
matched for equivalent migration all of the spots should exactly overlay 
between the 2 images. The % of spots within +1-2 pixels was 
5 measured to determine the overall matching. 

Example 3 Difference Gel Electrophoresis of Dyes Matched for 
Saturation Labelling on Cysteine Residues by Variation of the Alkyl Chain 
Length (r) 



Dye structure effects were tested by labelling the same complex 
protein sample (E.coli lysate) with each dye, mixing the Cy3 and Cy5 
labelled samples and separating by 2D electrophoresis. Following 
fluorescence scannning, the images were overlayed and analysed as 
15 described in the above example. 

E.coli lysate was labelled with Cy3 compounds and Cy5 
compounds as described. The labelled samples were mixed prior to 
separation to give dye sets 1 , 2, 9, 1 1 and 1 2 to show the effect of 
20 varying the length of the alkyl chain by the addition of CH2 units. Table 3 
shows the quantitative positional data following analysis of the overlaid 
images of protein labelled with dye sets 1, 2, 9, 1 1 and 12. 



10 



25 



30 
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Table 3: Effect of Varying Alkyl Chain Length (r) on Positional 
Matching 



Dve Set 


Value of n 


p 


q 


r 


Matched Spots within 2 












Pixels (%) 














X (ol) 


Y (mass) 


1 


I tuyo; 


O 




9 


82.7 


88.8 




2 (Cv5) 


5 


2 








2 




c 
O 


o 
z. 


q 
o 


79.7 


77.7 




2 (Cy5) 


5 


2 


2 






9 


1 (Cy3) 


5 


2 


3 


76 


76.3 




2 (Cy5) 


5 


2 


1 






1 1 


1 (Cy3) 


5 


2 


2 


86.1 


44.3 




2 (Cy5) 


5 


2 


2 






12 


1 (Cy3) 


5 


2 


1 


59.7 


58.9 




2 (Cy5) 


5 


2 


1 







Thus, when the Cy3 and Cy5 have equal alkyl chains (e.g. dye sets 
1 1 and 12) the overall mass of Cy3 is decreased relative to Cy5. This 
results in poor positional matching in the mass dimension. When there is 
a two carbon unit difference in the alkyl chain length (e.g. dye set 9), 
mass matching improves. Optimum matching for both mass and pi is 
obtained when there is a single carbon atom difference between the dyes 
(e.g. dye set 1 and 2). The preferred dye set is dye set 1 . This shows 
that migration matching using a single carbon unit difference in the linker 
r gives good positional matching of differentially labelled proteins on 2D 
electrophoresis. 



Figure 2 shows overlay images of proteins labelled with dye sets 1 , 
1 2 and 9 and separated by 2D electrophoresis with outlines of labelled 
protein spots to demonstrate the positional matching. The overlays are 
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taken from a portion of a 2D electrophoresis gel of E.coli lysate labelled 
with Cy3 or Cy5 showing the — 20 - 30kDa mass range and — 5.5 - 6.0 
pi range. The circles representing the outline positions of the protein 
spots for both the Cy3 and Cy5 labelled proteins (arrowed) were 

5 determined using the 2D ImageMaster software. Dye set 1 2 gives poor 
mass matching as the Cy5 labelled proteins run below the Cy3 labelled 
proteins (example arrowed). Dye set 9 also shows poor mass matching, 
in this case with the Cy5 labelled proteins running above the Cy3 labelled 
proteins (example arrowed). Preferred dye set 1 gives the optimal 

10 positional matching. 



Example 4 Difference Gel Electrophoresis of Dyes Matched for 
Saturation Labelling on Cysteine Residues by Variation of the Length of 
the Linker to the Indole Nucleus (p) 

E.coli lysate was labelled with Cy3 and Cy5 compounds and 
samples mixed to give dye sets 1 and 3 as described. Table 4 shows the 
quantitative positional data following analysis of the overlaid images of 
protein labelled with these dye sets. 

Table 4: Effect of Varying Linker Length (p) on Positional Matching 



Dye Set 


Value of n 


P 


q 


r 


Matched Spots within 2 












Pixels (%) 














X (pi) 


Y (mass) 


1 


1 (Cy3) 


5 


2 


2 


87.3 


86.4 




2 (Cy5) 


5 


2 


1 






3 


1 (Cy3) 


5 


2 


2 


88.1 


85.2 




2 (Cy5) 


4 


2 


2 
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This shows that migration matching using a single carbon unit 
difference in the linker p gives good positional matching of differentially 
labelled proteins on 2D electrophoresis. 

Example 5 Difference Gel Electrophoresis of Dyes Matched for 
Saturation Labelling on Cysteine Residues by Variation of the Length of 
the Linker to the Maleimide Group (q) 



Variation of the linker q was achieved by increasing the length of 
10 the linker by a single CH2 unit from an aminoethyl to aminopropyl 

maleimide. E.coli lysate was labelled with Cy3 and Cy5 compounds and 
samples mixed to give dye sets 1, 5 and 7 as described. Table 5 shows 
the quantitative positional data following analysis of the overlaid images 
of protein labelled with these dye sets. 

15 

Table 5: Effect of Varying Maleimide Linker Length (q) on Positional 
Matching 



Dye Set 


Value of n 


P 


q 


r 


Matched Spots within 2 












Pixels (%) 














X (pi) 


Y (mass) 


1 


1 (Cy3) 


5 


2 


2 


85.7 


88.0 




2 (Cy5) 


5 


2 


1 






5 


1 (Cy3) 


5 


3 


2 


64.2 


80.7 




2 (Cy5) 


5 


2 


2 






7 


1 (Cy3) 


5 


3 


2 


67.6 


71.8 




2 (Cy5) 


5 


2 


1 







20 



Thus, the aminoethyl linker (dye set 1) has improved pi matching 
compared to the aminopropyl linker (dyes sets 5 and 7) possibly due to 
the p-effect. When there is a two carbon unit difference between the 
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dyes the mass matching also decreases {dye set 7). When there is a 
single carbon unit difference mass matching improves. Thus, an 
aminoethyl maleimide linker (q = 2) is preferred for pi matching. 



5 Example 6 Difference Gel Electrophoresis of Dyes Matched for 

Saturation Labelling on Cysteine Residues by Variation of the Position of 
the Sulphonate Group 

Sulphonated cyanine dyes generally have the sulphonate group 
10 directly attached to the indole ring as in dye set 1 . Variation of the 

sulphonate position was achieved by attaching the sulphonate to the ring 
via a butyl chain linker. E.coli lysate was labelled with Cy3 and Cy5 
compounds and samples mixed to give dye sets 1 , 4, 6, 8 and 10 as 
described. Table 6 shows the quantitative positional data following 
15 analysis of the overlaid images of protein labelled with these dye sets. 



Table 6: Effect of Varying Sulphonate Position on Matching 



Dye 


Sulphonate 


Value of n 


P 


q 


r 


Matched Spots within 


Set 


Position 










2 Pixels (%) 














X (pi) 


Y (mass) 


6 


Pendant 


1 (Cy3) 


5 


3 


4 


86.3 


82.6 




Pendant 


2 (Cy5) 


5 


2 


4 






4 


Pendant 


1 (Cy3) 


5 


2 


4 


78.3 


79.7 




Pendant 


2 (Cy5) 


4 


2 


4 






8 


Pendant 


1 (Cy3) 


5 


3 


4 


73.3 


63.7 




Pendant 


2 (Cy5) 


4 


2 


4 






10 


Pendant 


1 (Cy3) 


5 


2 


4 


82.4 


49.8 




Pendant 


2 (Cy5) 


5 


2 


4 
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Comparison with previous data shows that the removal of the 
sulphonate from the ring system to a pendant butyl chain does appear to 
change the pi matching. The matching in the mass dimension is no better 
than that achieved with dye set 1 . Where there is no compensation for 

5 the difference n between the dyes the migration matching decreases (dye 
set 10). When there is a two carbon unit difference (dye set 8), mass 
matching is decreased compared to dye set 1 . When there is a single 
carbon unit difference (dye sets 4 and 6) matching is improved. The best 
combination with a pendant butyl sulphonate is to modify the linker q to 

10 the maleimide (dye set 6). 
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Claims 



1 . A matched set of fluorescent dyes comprising at least two different 
fluorescent dyes of formula (I): 

5 




(I) 

wherein n is different for each said dye and is 1 , 2, or 3; 
Z 1 and Z 2 independently represent the carbon atoms necessary to 
complete a phenyl or naphthyl ring system; 
15 one of groups R 1 and R 2 is the group: 

O 
II 

— (CH2) p C— NH (CH2) q Y 

where Y is a target bonding group; 
20 remaining group R 1 or R 2 is selected from -(CH2)4-W or _(CH 2 )r-H; 

group R 3 is hydrogen, except when either R 1 or R 2 is _(CH 2 )r-H, in which 
case R 3 is W; 

W is selected from sulphonic acid and sulphonate; 
p is an integer from 3 to 6; 
25 q is selected to be 2 or 3; and 
r is an integer from 1 to 5; 
and their salts; 

characterised in that when n of two of said dyes differs by + 1 , one of p ? 
q and r of said two dyes differs by -1 . 
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2. A matched set of fluorescent dyes comprising at least two different 
fluorescent dyes of formula (II): 




(II) 

wherein n is different for each said dye and is 1 , 2, or 3; 
one of groups R 1 and R 2 is the group: 



— (CH2) p C— NH — (CH2) q — Y 

where Y is a target bonding group; 

remaining group R 1 or R 2 is selected from -(CH 2 )4-W or -(CH 2 )r-H; 
group R 3 is hydrogen, except when either R 1 or R 2 is -(CH 2 )r~H, in which 
case R 3 is W; 

W is selected from sulphonic acid and sulphonate; 
p is an integer from 3 to 6; 
q is selected to be 2 or 3; and 
r is an integer from 1 to 5; 
and their salts; 

characterised in that when n of two of said dyes differs by + 1 , one of p, 
q and r of said two dyes differs by -1 . 

3. A matched set according to claim 1 or claim 2 comprising at least 
two different fluorescent dyes according to formula (I) or (II) in which: 
n is selected to be 1 or 2; 
p is selected to be 4 or 5; 



WO 2004/005933 ^fcCT/GB2002/003142 

-47- 

q is selected to be 2 or 3; and 
r is selected to be 1 , 2 or 3. 



4. A matched set according to any of claims 1 to 3 wherein said 

5 target bonding group Y in each dye of the set of dyes is the same and is 
selected from a maleimido group and an iodoacetamido group. 

5. A matched set according to claim 4 wherein in each said dye Y is a 
maleimido group. 

o 

6. A matched set according to any of claims 1 to 5 wherein said salts 
are selected from K + , Na + , NH4 + , R 3 NH + and FUN + where R is Ci to C 4 
alkyl. 



15 7. A matched set of dyes according to any of claims 1 to 6 selected 
from: 



Set 1 

20 1-(6-{[2-(2 / 5-dioxo-2 / 5-dihydro-1A/-pyrrol-1-yl)ethyl]amino}-6-oxohexyl)-2- 
[dE^a-S-d-ethyl-S^-dimethyl-S-sulpho-l^-dihydro^Ay-indol^- 
ylidene)prop-1-enyl]-3,3-dimethyl-3f/-indolium (Compound I); and 
1 -(6-{[2-(2,5-dioxo-2,5-dihydro-1 /V-pyrrol-1 -yl)ethyl]amino}-6-oxohexyl)- 
S^-dimethyl^-fdE^E^a-S-d^^-trimethyl-S-sulpho-l ,3-dihydro-2//- 

25 indol-2-y!idene)penta-1 ,3-dienyl]-3f/-indolium (Compound II); 



Set 2 



30 



1-(6-{[2-{2,5-dioxo-2,5-dihydro-1/y-pyrrol-1-yl)ethyl]amino}-6-oxohexyl)-2- 
[(1 £,3£)-3-(1 -propyl-3,3-dimethyl-5-sulpho-1 ,3-dihydro-2A/-indol-2- 
ylidene)prop-1-enyl]-3,3-dimethyl-3//-indolium (Compound III); and 
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1 -(6-{[2-(2,5<Jioxo-2,5-dihydro-1 //-pyrrol-1 -yl)ethyl]amino}-6-oxohexyl)- 
3,3-dimethyI-2-[(1£,3£,5tf-5-(lH^^ 

2//-indol-2-ylidene)penta-1,3-dienyl]-3^-indolium (Compound IV); 



Set 3 



1 -(6-{[2-(2,5-dioxo-2,5-dihyd^ 
[(1£,3£)-3-(1-ethyI-3,3<iimethyl^^^ 

ylidene)prop-1 -enyl]-3,3-dimethyI-3/y-indolium (Compound I); and 

1 -(5-{[2-(2,5-dioxo-2,5-dihydro-1 //-pyrrol-1 -yl)ethyl]amino}-6-oxopentyl)- 

S^-dimethyl^-ttl^SE^fJ-S^I-ethyl-S^-trimethyl-S-sulpho-l^-dihydro- 

2#-indol-2-ylidene)penta-1 ,3-dienyl]-3//-indolium (Compound V); 



Set 4 



1 -(6-{[2-(2,5-dioxo-2,5-dihydro-1 /Apyrrol-1 -yl)ethyl]amino}-6-oxohexyI)-2- 

[(I^Sa-S-tS^-dimethyld-sulpho-butyD-l^-dihydro^/y-indol^- 

ylidene)prop-1-enyl]-3,3<JimethyI-3A/-indolium (Compound VI); and 

1 -(5-{[2-(2 / 5-dioxo-2,5-dihydro-1 #-pyrrol-1 -yl)ethyl]amino}-6-oxopentyl)- 

3,3-dimethyl-2-[(1£,3£,5£)-5-(3,3-d^ 

2//-indol-2-ylidene)penta-1 ,3-dienyl]-3//-indolium (Compound VII). 



Set 5 



1 -(6-{[3-(2,5-dioxo-2,5-dihydro-l //-pyrrol-1 -yl)propyl]amino}-6-oxohexyl)- 
2-[( 1 F # 3£)-3-( 1 -ethyl-3^ 

ylidene)prop-1-enyl]-3,3-dimethyl-3/y-indolium (Compound VIII); and 

1 -(6-{[2-(2,5-dioxo-2,5-dihydro-1 //-pyrrol-1 -yl)ethyl]amino}-6-oxohexyl)- 

3,3-dimethyl-2-[(1£,3£,5£)-5-(1-ethy^^ 

2//-indol-2-ylidene)penta-1,3-dienyl]-3//-indolium (Compound IV); and 



WO 2004/005933 

Set 6 

1 -(6-{[3-(2,5-dioxo-2,5-dihydro-1 //-pyrrol-1 -yl)propyl]amino}-6-oxohexyl)- 
2-[(1£,3£)-3-(3,3<Jimethyl(1-sulpho4^^ 
5 ylidene)prop-1-enyl]-3,3-dimethyl-3AY-indolium (Compound IX); and 

1 -(6-{[2-(2,5-dioxo-2,5-dihydro-1 tf-pyrrol-1 -yl)ethyl]amino}-6-oxohexyl)- 
S^-dimethyl^-tdE^E^^-S-O^-dimethylHI-sulpho-butyD-l ,3-dihydro- 
2//-indol-2-ylidene)penta-1 ,3-dienyl]-3//-indoIium (Compound X). 

8. A method for labelling a mixture of proteins in a sample wherein 
each of said proteins contains one or more cysteine residues, said method 
comprising: 

i) adding to an aqueous liquid containing said sample a fluorescent 
dye wherein said dye contains a target bonding group that is covalently 
reactive with said proteins; and 

ii) reacting said dye with said proteins so that said dye labels said 
proteins; 

characterised in that all available cysteine residues in said proteins are 
labelled with said dye. 

9. A method according to claim 8 wherein said fluorescent dye is a 
cyanine dye. 

10. A method according to claim 9 wherein said cyanine dye contains a 
25 sulphonic acid or sulphonate group. 

11. A method according to any of claims 8 to 10 wherein said target 
bonding group is selected from a maleimido group and an iodoacetamido 
group. 
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12. A method according to claim 8 further comprising prior to step i), 
the step of treating the protein with a reductant. 

13. A method according to claim 8 wherein said dye is used in a range 
5 of 5 to 200nmol of dye per 50ug of protein. 

14. A method according to claim 8 wherein said labelling is performed 
at a pH in the range from 6.0 to 9.0. 

io 15. A method for labelling one or more proteins in a sample, the 
method comprising: 

i) adding to a liquid sample containing said one or more proteins a 
fluorescent dye of formula (l): 



15 




20 (") 

wherein n is different for each said dye and is 1 , 2, or 3; 
Z 1 and Z 2 independently represent the carbon atoms necessary to 
complete a phenyl or naphthyl ring system; 
one of groups R 1 and R 2 is the group: 
25 O 

— (CH2) p C— NH (CH2) q Y 

where Y is a target bonding group; 

remaining group R 1 or R 2 is selected from -(CH2>4-W or -(CH2>r-H; 
30 group R 3 is hydrogen, except when either R 1 or R 2 is -(CH 2 )r-H, in which 
case R 3 is W; 
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W is selected from sulphonic acid and sulphonate; 
p is an integer from 3 to 6; 
q is selected to be 2 or 3; and 
r is an integer from 1 to 5; 
5 and their salts; 

characterised in that when n of two of said dyes differs by + 1 , one of p, 
q and r of said two dyes differs by -1 ; and 

ii) incubating said dye with said sample under conditions suitable for 
labelling said one or more proteins. 

10 

16. A method according to claim 15 wherein each of Z 1 and Z 2 
represents the carbon atoms necessary to complete a phenyl ring system. 

17. A method according to claim 15 or claim 16 wherein: 
15 n is selected to be 1 or 2; 

p is selected to be 4 or 5; 

q is selected to be 2 or 3; and 

r is selected to be 1, 2 or 3. 

20 18. A method according to any of claims 1 5 to 17 wherein said target 
bonding group Y is selected from a maleimido group and an 
iodoacetamido group. 

19. A kit comprising a matched set of fluorescent dyes comprising at 
25 least two different fluorescent dyes having the formula (I): 
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(0 

wherein n is different for each said dye and is 1 , 2, or 3; 
Z 1 and Z 2 independently represent the carbon atoms necessary to 
complete a phenyl or naphthyl ring system; 
io one of groups R 1 and R 2 is the group: 

O 

— (CH^p C— NH (CH^q — Y 

15 where Y is a target bonding group; 

remaining group R 1 or R 2 is selected from -(CH 2 )4-W or -(CH 2 )r-H; 
group R 3 is hydrogen, except when either R 1 or R 2 is -(CH 2 )r-H, in which 
case R 3 is W; 

W is selected from sulphonic acid and sulphonate; 
20 p is an integer from 3 to 6; 
q is selected to be 2 or 3; and 
r is an integer from 1 to 5; 
and their salts; 

characterised in that when n of two of said dyes differs by + 1 , one of p, 
25 q and r of said two dyes differs by -1 . 
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Schematic Diagram of Example Workflow 
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Figure 2/2 

Overlay Images of Proteins Labelled with Dve Sets 1.12 and 9 
and Separated bv 2D Electrophoresis with Outlines of Labelled 
Protein Spots to Demonstrate Positional Matching 

Overlay of Preferred Dve Set 1 
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